Percolation and Loop Statistics in Complex Networks 
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Complex networks display various types of percolation transitions. We show that the degree 
distribution and the degree-degree correlation alone are not sufficient to describe diverse percolation 
critical phenomena. This suggests that a genuine structural correlation is an essential ingredient 
in characterizing networks. As a signature of the correlation we investigate a scaling behavior in 
Mjv(/i), the number of finite loops of size h, with respect to a network size N. We find that networks, 
whose degree distributions are not too broad, fall into two classes exhibiting Mjv(ft) ~ (constant) 
and Mjv(/i) ~ (IniV) , respectively. This classification coincides with the one according to the 
percolation critical phenomena. 

PACS numbers: 89.75.Hc, 05.10.-a, 05.70.Fh, 05.50.+q 
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Introduction: Complex networks have been attract- 
ing much interest during the last decade. Structure, dy- 
namics, and collective phenomena have become intrigu- 
ing research subjects in network science [l|, 0, 0, HI ■ This 
work considers a structural correlation in complex net- 
works. Discussing scaling behaviors of percolation tran- 
sitions [!, i, 0, H U EO, [H E3, E, El , we will show that 
the structural correlation is an essential ingredient char- 
acterizing structural properties and collective phenomena 
of complex networks. 

Structural inhomogeneity is one of the most salient fea- 
tures of complex networks. It is reflected on the broad 
degree distribution pd 0] • For the study of the inhomo- 
geneity, the uncorrelated network has been considered. It 
is defined as an ensemble of networks specified only with 
a given degree distribution but random in any other as- 
pect. This includes the Erdos-Renyi random graph [j, 
the Molloy-Reed model [HI , and the static model lj| 17 1 . 
It is revealed that the structural inhomogeneity encoded 
in pd leads to rich collective phenomena [J] . 

Many real- world networks display some structural cor- 
relations. The degree- deg ree (DD) correlation has been 
considered mostly (1, Ha j. It refers to the correlation 
between degrees of nodes linked directly with an edge. 
Networks with a positive (negative) correlation are called 
to be assortative (disassortative). The DD correlation is 
represented with the two-point degree correlation func- 
tion p(d! , d) denoting the fraction of edges linking nodes 
of degree d and d 1 . With the DD correlation being incor- 
porated, the randomly correlated (RC) network has been 
considered. It is defined as an ensemble of networks spec- 
ified only with p(d!,d), but again random in any other 
aspect [E 0, Ill- 
St udying the RC network is a meaningful attempt to 
understand correlated networks. However, it remains un- 
known to what extent the RC network is a proper model 
for correlated networks. In this work we will show that 
there exists a class of networks with a genuine structural 
correlation that cannot be captured by the DD correla- 
tion. This will be shown by discussing percolation criti- 



cal phenomena in various networks. We will also suggest 
that number statistics of loops is useful as a signature of 
the correlation. 

Percolation: Consider a bond percolation problem 
with the occupation probability / in the RC network. 
It is specified with a degree correlation function p(d' , d). 
Other characteristics are easily represented with it [8j. 
For examples, qd = p{d' ,d) is the probability that 
an edge chosen randomly leads to a node of degree d. It 
is related to the degree distribution as qd — dpdj (d) with 
the mean degree (d). The conditional probability that 
the degree of a neighbor of a degree-d node is d' is given 
by p(d'\d) = p(d',d)/qd- The uncorrelated network has 
the property that p(d', d) = qd>qd or p(d'\d) = q^ ■ 

We summarize briefly the theory for the percolation in 
the RC network (see Refs. [1,0 for detail). The perco- 
lation order parameter P^, the fraction of nodes in the 
infinite cluster, is given by 



Poc = 1 - ^Pd (ud) d 



(1) 



where {ud} should satisfy the self-consistent equation 



Ud 



(2) 



Here Ud denotes the probability that an edge from a 
degree-d node leads to a finite cluster. It has a triv- 
ial solution {ud = 1} corresponding to P^ = 0. Hence 
the percolation threshold can be obtained from the linear 
stability analysis around the trivial solution. It leads to 
f c = 1/A max [1,0, where A max is the maximum eigen- 
value of the matrix C with elements 



C dd ' = (d' - l) P (d'\d) 



(3) 



it 



For the uncorrelated network having p{d'\d) = cu 
yields the well-known result f c = (d)/({d 2 ) -(d)) [6(. 

One can also find the mean cluster size 5, average size 
of finite clusters enclosing a node chosen randomly [1, . 
For the sake of simplicity, we consider only the region 



2 



f < fc where Ud = 1 for all d. The same conclusion can 
be drawn in the region / > f c , which is not presented 
here. It is given by 



S = 1 + ^ d p d v d 

d 



(4) 



where {vd} should satisfy the self-consistent equation 
Vd = f + f J2d' Cdd'Vd 1 - Using the matrix and vector 
notation, the solution is given by 



v= (I-/C)- 1 -/ 



(5) 



where v = (vi,V2, • ■ ■ ), / = (/,/,■■• ), and I is the iden- 
tity matrix. 

The onset of the percolation transition and the mean 
cluster size is determined with the matrix C. As one 
approaches the percolation threshold f c — 1/A max , the 
largest eigenvalue of /C approaches unity, the matrix 
(I — /C) _1 becomes singular, and v^'s become diver- 
gent [8|. Consequently, the mean cluster size S diverges 
as / — ► f c . This analysis shows that the percolation in 
the RC network as well as in the uncorrelated network 
should be accompanied with divergent S regardless of the 
shape of p(d', d). 

On the other hand, recent studies have revealed that 
some networks display percolation transitions which are 
associated with non-divergent mean cluster sizes. These 
are the growing network (GN) models [l(J EH, B El 
and the exponential random graph (ERG) model [14| . 
The GN model is defined through a growth rule. In the 
GN model of Callaway et al. [R|, for instance, a node is 
added each time step and an edge is added with the prob- 
ability S between a randomly selected pair of nodes. This 
model undergoes an infinite order percolation transition 
at 5 C = 1/8 and the mean cluster size S remains finite 
at the transition point. The degree correlation function 
p(d',d) of the network is known [1(3]. With the corre- 
lation function, one might approximate the network as 
the RC network and apply the above theory. However, 
it would yields that the mean cluster size diverges as 
S ~ \6- 5 C \- V0 with S c ~ 0.192 [H, which is not the 
case. 



Another example is the ERG [14j which is defined as 
the equilibrium ensemble of networks 21| with the Pois- 
son degree distribution. The model has an interesting 
feature that one can adjust the strength of the DD corre- 
lation with a parameter J (see Ref. [14| for details). The 
network is assortative (disassortative) with positive (neg- 
ative) values of J, and uncorrelated when J = 0. Numer- 
ical study of the percolation in Ref. showed that the 
disassortative network with J = — 1 belongs to the same 
universality class as the uncorrelated network with J = 0. 
On the contrary, the assortative network with J = 1 was 
found to display the similar type of percolation transition 
to the GN model. That is to say, the mean cluster size 
S is finite at the percolation threshold. 



A common feature of the GN models and the ERG 
model with positive J is an assortative DD correlation. 
However, it is evident that they cannot be described as 
the RC network. It suggests that there exists a genuine 
structural correlation that is responsible for the distinct 
universality class of the percolation transitions and can- 
not be captured only by the DD correlation. 

We propose that the correlation can be character- 
ized with loop structure. An /i-loop or ft,-cycle is de- 
fined as a self-avoiding path through h distinct nodes. 
Loops in complex networks have been studied in liter- 
atures [h, 0, ei, m s m, ef ' 



27, 25 




The clus- 
tering is related to the number of triangles, loops of 
size h = 3 0, Ell- Dynamic scaling behaviors on net- 
works are shown to depend on whether there are loops 
12311 . Loops of system sizes are also stud- 
io . A loop forms when two end nodes of a 
linear path are linked to each other. In an uncorrelated 
sparse network of size TV, such an event may occur by 
chance with the probability of the order of 0(1/N). Any 
structural correlation may be detected from deviations 
from uncorrelated network results. 

Loops in uncorrelated and RC networks: We 
first consider, as an uncorrelated network, the static 
model [lil with N nodes and KN edges. Each node 
i = 1,2, • ■ • , N is assigned to a selection probability 

u)i = (X^ii -AI ) with a parameter < fj, < 1. 

Edges are added successively by connecting two nodes 
chosen with the selection probability. This leads to a 
scale-free network with the degree exponent 7=1 + l//i, 
which is free from the DD degree correlation for /i < 1/2 
or 7 > 3 [13. 

In evaluating the number of loops, it is crucial to find 
the connecting probability between two nodes i and 
j. The mean number M^(h) of /i-loops is then given by 

M N (h) = T,i u - ,i h fiii 2 fi2i3 ■ ■ ■ fihii with the factor 
l/(2/i) compensating for overcounting. In the sum the in- 
dices should be mutually distinct, which will be neglected 
because it leads to a sub-leading order correction. 

The connecting probability is given by = 1 — 
e^KNu^j jjjj For ^ < or 7 > 3 ; one can use 

the approximation /y ~ 2KNujiLOj ljj . This yields that 



M, 



N 



(6) 



with / = 53i=i ■ I n the large N limit, it is given by 



/ = 



(1-2 M ) 



N , Inserting this into Eq. ([6]), we obtain 



M N (h) 



1 f2K(l-nf 



2h V 1 - 2/i 



(7) 



The result shows that Mat(/i) is finite for any finite h 
when 7 > 3. In the case when jj, > 1/2 or 7 < 3, Miy(h) 
grows algebraically with N [29j. 



We also consider the RC network specified with a de- 
gree correlation function p(d' ', d). The number of 3-loops 
or triangles in the RC network was studied in Ref. [l9(. 
One can generalize the analysis to find M^(h) for arbi- 
trary finite values of h. It is given by [20j | 



M N (h) 



(8) 



■ Ah 1=1 



where dh+i — d\. In the uncorrelated limit where 
p(d\d') = q d = dp d /{d), it reduces to M N (h) = (({d 2 ) - 
(d))/ (d)) h /(2h) which coincides with the one obtained for 
the Molloy-Reed model [13]. Analyzing this formula, we 
find that Mjv(/i) is also finite in the RC network unless 
the DD correlation is so perfect that p(d\d') is peaked at 
d = d! . Detailed analysis will be presented elsewhere [20j j . 

Loops in correlated networks: First we consider 
the GN model of Callaway et al. A node added at 
ith step will be labeled with the index i = 1, • • • ,N. The 
growth rule implies that the connecting probability /y 
for i > j is given by 



N 

/«=i- n 

n— i+1 



l - 



np2 



25 N 



N 



+ 0(N- 3 ) 



It is not symmetric in i and j, for which one needs some 
caution in evaluating M?j(K). 

For three nodes i\ > i 2 > is, a possible 3-loop config- 
uration is unique as shown in Fig. [Ha). So one has that 

m n (3) = Y, ll>i2 >i 3 /i^iWaiw In the lar s e N limit > 

it can be written as 



M N (3) = (25) 3 [V^]g{x 2 ){g{xx)Y 



(9) 



where J[D h x] = f 1/N dx h f Xh dx h ^ x ■ ■ ■ dxi and 
g(x) = (1 —x)/x. Note that the integral is dominated 
by the contribution near x n ~ 0. So, hereafter we will 
approximate the function g(x) — (1 — x)/x as g(x) = l/x, 
which does not change the leading order behavior. Eval- 
uating the integral, we obtain that 



Mjv(3) = (2<5) 3 lniV 



(10) 



For four nodes i\ > i 2 > «3 > «4, there are three 
distinct loop configurations as shown in Fig. [ljb). One 
can count the number of loops in each configuration sep- 
arately. Summing them up, one obtains that M/v(4) = 

(2Syj[V 4 K] (2g(x 3 )g(x 2 ) (g^)) 2 + (g(x 2 )f (g( Xl ) f 

It yields that 



M w (4) = ^(2<5) 4 lniV. 



(11) 



For h nodes i± > ■ ■ ■ > ih, the number of loop con- 
figurations is given by h\/(2h). A loop configuration C 
can be specified with a n (f3 n ), defined as the number of 






FIG. 1: Loop configurations of nodes ii > • ■ ■ > ih from 
bottom to top with h — 3 in (a) and h — 4 in (b). Also shown 
are the indices a n (see text). 



older (younger) partners of node i n (see Fig. [TJ . They 
are non-negative and satisfy a n + (3 n — 2 for all n. For 
later use, we also define the quantities A n = XlILi a i 
and B n = 53ILi Pi satisfying A n + B n = 2n. Then 
the number of /i-loops of configuration C is given by 
M e N {h) = (2S) h /[2? fc x] ULi {9(xn)) an - It yields that 



M c N (h) = (2S) h 



h-l 

n 



In N. 



(12) 



It was assumed that A n ^ n for n < h, which can be 
proved easily [30l ]. Summing over all C, we obtain that 



M N {h) = C h {28) h \nN , 



(13) 



where the combinatorial factor Ch takes the value C\ = 
1, C2 = 5/2, C3 = 7, etc [31!]. In contrast to the uncorre- 
lated and the RC networks, the total number of /i-loops 
in the GN model shows the logarithmic scaling. 

We also study the correlated scale- free networks. Doro- 
govtsev et al. ll| extended the GN model of Callaway et 
al. [HI by adopting the preferential selection rule; upon 
creating edges, a node with degree d is selected with the 
probability proportional to (d + a) with a parameter a. 
The resulting networks are scale-free with the degree ex- 
ponent 7 = 2 + a j (26). It is straightforward to show that 
the connecting probability fij cx i _ (">'~ 2 )/(')'~ 1 )j _1 /(T~ 1 ) 
for two nodes i > j. It is asymmetric as in the pre- 
vious case. So one can follow the same procedure to 
calculate M^(h). We also find the logarithmic scaling 
M^(h) ~ IniV for 7 > 3. Details will be presented else- 
where 

Finally we study numerically the loop statistics in 



the ERG model [1J|. An ensemble of ERG networks is 



generated using the Monte Carlo method explained in 
Ref. 14| and the number of loops is enumerated numeri- 
cally. Figure [2] presents the numerical results for Ma?(/i) 
in the networks with a disassortative (J = —1), the neu- 
tral (J = 0), and an assortative ( J = 1) DD correlation, 
respectively. We find that Mat(/j)'s are finite in the dis- 
assortative network as well as in the neutral network. It 
is noteworthy that both networks display the percolation 
transition in the same universality class [14[ . On the con- 
trary, we find a logarithmic scaling M]y(h) ~ (lnN)^ h 
for the assortative network. This suggests that the as- 
sortative DD correlation, although insufficient, may give 
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FIG. 2: Numbers of /i-loops in the ERP model with a dis- 
assortative (a), neutral (b), and assortative (c) degree-degree 
correlation. 



rise to the structural correlation that allows one to dis- 
tinguish them networks from others. We remark that 
the exponent ifth seems to depend on h in the assorta- 
tive ERG model while it is independent of h in the other 
correlated networks. Its reason and implication has not 
been understood yet. 

Summary: We have studied the number of loops in 
complex networks. Discussing the scaling behaviors of 
percolation critical phenomena, we have shown that the 
structural correlation is a relevant feature of networks. 
The scaling behavior in the number Mj^(h) of /i-loops 
for finite h with network size N is suggested as a signa- 
ture of the correlation. When the degree distribution is 
not too broad, we have found that M^(h) is finite and in- 
dependent of N in one class of networks and that Mn(K) 
displays the logarithmic scaling M n (h) ~ (\nN)'^ h in the 
other class of networks. The former includes the uncorre- 
cted and the RC networks. The percolation transitions 
in those networks are characterized with the divergent 
mean cluster size. The latter includes the GN models 
and the assortative ERG model. The display the perco- 
lation transitions with non-divergent mean cluster size. 
They have the assortative DD correlation commonly, al- 
though the assortativity by itself is not a sufficient con- 
dition. Our work shows that the structural correlation, 
reflected on the loop statistics, is important for the per- 
colation critical phenomena. Relevance to dynamical and 
equilibrium critical phenomena will be studied later. 
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To a given value of n < h, one can separate the edges into 
three nonempty sets; one for edges among {ii, • • • , i n }, 
another for edges among {i n +i, • • • , ih}, and the other for 
edges interconnecting them. An edge in the first set adds 
unity to A n and B n , while one in the second set does 
not contribute to them. One the other hand, a edge in 
the third set adds unity to A n but nothing to B n . Hence 
one finds that A„ > B n for all n. Since A n + B n = 2n, it 
proves that A n > n. The equality holds only when n = h. 
A closed form expression for Ch is not known. An exact 
enumeration study suggests that it grows exponentially 



as C h 



with 



4.0. 



